Variable Dimension Vector Quantization of Speech Spectra for Low Rate Vocoders
نویسندگان
چکیده
Optimal vector quantization of variable-dimension vectors in principle is feasible by using a set of fixed dimension VQ codebooks. However, for typical applications, such a multi-codebook approach demands a grossly excessive and impractical storage and computational complexity. Efficient quantization of such variable-dimension spectral shape vectors is the most challenging and difficult encoding task required in an important family of low bit-rate vocoders. We introduce a simple and effective formulation of variable-dimension vector quantization (VDVQ) which quantizes variable-dimension vectors using a single universal codebook having fixed dimension yet covering the entire range of input vector dimensions under consideration. This VDVQ technique is applied to quantize variable-dimension spectral shape vectors leading to a high quality speech coder at the low bit-rate of 2.5 kb/s. The combination of a universal spectral codebook and structured VQ reduces storage and computational complexity, yet delivers a high quantization efficiency and enhanced perceptual quality of the coded speech.
منابع مشابه
Using FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder
Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).
متن کاملUsing FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder
Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).
متن کاملA Novel Dimension Conversion for the Quantization of SEW in Wideband WI Speech Coding
The waveform interpolation is one of the speech coding algorithms with high quality at low bit rates. In the WI coding, the vector quantization of SEW requires a variable dimension quantization technique since the dimension of the SEW amplitude spectrum varies depending on the pitch period. However, since the variable dimension vector makes a difficulty to employ conventional vector quantizatio...
متن کاملNatural quality variable-rate spectral speech coding below 3.0 kbps
We propose new techniques for natural quality variable rate spectral speech coding at an average rate of 2.2 kbps for dialog speech and 2.8 kbps for monolog speech. The coder models the Fourier spectrum of each frame and it builds on recent enhancements to the classical multiband excitation (MBE) approach. New techniques for robust pitch estimation and tracking, for e cient quantization of voic...
متن کاملNovel low-band phase representation for low bit-rate speech coding
Vector Quantization (VQ) has been extensively used in speech vocoders. Phase information is often ignored or coarsely represented in parametric coders because of the difficulties facing phase quantization. This paper introduces a novel distortion measure for the low-band speech signal that takes phase information into consideration, with no increase in the bit-rate. This measure has been used i...
متن کامل